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Unambiguously distinguishing between nonorthogonal but linearly independent quantum states is 
a challenging problem in quantum information processing. In principle, the problem can be solved 
by mapping the set of nonorthogonal quantum states onto a set of orthogonal ones, which then can 
be distinguished without error. Such nonunitary transformations can be performed conditionally on 
quantum systems; a unitary transformation is carried out on a larger system of which the system 
of interest is a subsytem, a measurement is performed, and if the proper result is obtained, the 
desired nonunitary transformation will have been performed on the subsystem. We show how to 
construct generalized interferometers (multiports) , which when combined with measurements on 
some of the output ports, implement nonunitary transformations of this type. The input states are 
single-photon states in which the photon is divided among several modes. A number of explicit 
examples of distinguishing among three nonorthogonal states are discussed, and the networks that 
optimally distinguish among these states are presented. 
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I. INTRODUCTION 

The time evolution of a closed quantum system is uni- 
tary, hence scalar products (angles between two quantum 
states) are conserved. However, when measurements are 
made on the system, it is possible to perform prescribed 
non-unitary operations with a certain probability of suc- 
cess. In particular, it is possible to alter the value of 
scalar products and, hence, the angle between state vec- 
tors. Such operations can be used to unambiguously dis- 
criminate among non-orthogonal quantum states. A set 
of non-orthogonal states is mapped onto a set of orthog- 
onal ones, and the orthogonal states can be distinguished 
without error. According to the quantum theory of mea- 
surement, such a non- unitary transformation will always 
have a certain probability of failure, which, for the dis- 
crimination of non-orthogonal states, corresponds to the 
probability that we obtain inconclusive answers. Our aim 
here is to find the optimum solution that minimizes the 
average probability of failure. 

Considerable work has been done on this problem. The 
simplest case, distinguishing two nonorthogonal states 
was first considered by Ivanovic pj], and then subse- 
quently by Dieks Q and Peres ||. These authors found 
the optimal solution when the two states are being se- 
lected from an ensemble in which they are equally likely. 
The optimal solution for the situation in which the states 
have different weights was found by Jaeger and Shimony 
Q . One can also consider what happens if the discrimi- 
nation is not completely unambiguous, i. e. if it is possi- 
ble for errors to occur, and this was done by Chefies and 
Barnctt [|. The case of three states was examined by 
Peres and Terno ||. The general iV-state problem has 
been studied by Chefies j7j, by Chefies and Barnett ||, 
and by Duan and Guo |l . Chefies and Barnett employed 



the POVM formalism and specifically solved the case in 
which the probability of the procedure succeeding is the 
same for each of the states. Duan and Guo considered 
general unitary transformations and measurements on a 
Hilbert space containing the states to be distinguished 
and an ancilla, which would allow one to discriminate 
among N states, and derived matrix inequalities which 
must be satisfied for the desired transformations to exist. 

For the experimental realization of quantum informa- 
tion processing, one must choose a physical system to 
represent a qubit. Some possibilities which have been 
used are energy levels of ions, the orientation of a nuclear 
spin, and the presence or absence of a photon in a cav- 
ity jl(|- jn|. Another possibility, the so-called dual-rail 
representation of a qubit, was proposed by Milburn Jl4| , 
and later by Chuang and Yamamoto [ p^|JT^ ] . A photon is 
split between two modes which represent and 1. When 
the elementary carriers of the information are more than 
two dimensional objects (qutrits,. . ., qunits in general, 
for n dimensions), one needs a more general representa- 
tion. Here we will show that single photon states can 
be used to represent general non-orthogonal states in n 
dimensions, and how this representation can be used for 
state discrimination. A photon is now divided among n 
modes which represent the numbers 0, 1, . . . , n — 1. The 
method is a straightforward generalization of the dual 
rail representation of a qubit for more than two dimen- 
sions, and can be called the multiple rail representation 
of a qunit. An optical multiport, a kind of a generalized 
interferometer with more than two inputs and outputs, 
together with measurements made by the photon detec- 
tors placed at some or all of the output ports, can condi- 
tionally realize the desired non-unitary transformations 
of the initial, non-orthogonal single-photon states into or- 
thogonal states. Our previous paper JTg[ ] which proposed 
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an optical realization to optimally discriminate between 
two non-orthogonal states is a special case of the method 
presented here. 

Optical experiments to distinguish between two quan- 
tum states have already been carried out, first by Hut- 
tner, et. al. jlTj and, more recently, by Clarke, et. al. p8[ . 
Both of these used the polarization states of photons to 
represent qubits. 

This paper is divided into six sections. In Sec. II, 
we present a method for calculating the optimum proba- 
bilities of unambiguous discrimination between linearly- 
independent, non-orthogonal states. In Sec. Ill, the gen- 
eral properties of a quantum system, which realizes the 
optimum non-unitary transformation, are found by as- 
suming that the optimum probabilities are known. In 
Sec. IV, we will show how an optical multiport, which is 
designed to perform a particular unitary transformation, 
together with measurements at its output ports can real- 
ize non-unitary transformations of non-orthogonal input 
states represented by single photon states. Reck et al. 
p0{ gave a method to decompose any multiport into a 
series of beam splitters, phase shifters, and mirrors, and 
we will use this method to construct the desired mul- 
tiport. The general method is then illustrated by ap- 
plying it to qutrits in Sec. V. Examples are presented 
for the realization of transformations that convert three 
specific non-orthogonal states to orthogonal ones with a 
maximum probability of success. A brief discussion and 
conclusions are given in Sec. VI. 



II. OPTIMAL PROBABILITIES FOR 
UNAMBIGUOUS DISCRIMINATION AMONG 
NON-ORTHOGONAL QUANTUM STATES 

Suppose we are given a quantum system prepared in 
the state which is guaranteed to be a member of the 
set of non-orthogonal states {IV'i): IV^), • ■ ■ , IVVi)}, but we 
do not know which one. We want to find a procedure 
which will tell us which member of the set we were given. 
The procedure may fail to give us any information about 
the state, and if it fails, it must let us know that it has, 
but if it succeeds, it should never give us a wrong answer. 
We shall refer to such a procedure as state discrimination 
without error. Note that this procedure has n + 1 out- 
comes; it either tells us which state we were given, or it 
tells us that it failed (inconclusive outcome). 

In order to achieve error-free discrimination, Chefles 
has shown, in a very clear analysis of the problem, that 
the set 1^1)7 \i , 2)i ■ ■ ■ , \ip n ) must be linearly independent 
0. If the states are not orthogonal (which we shall 
assume), they cannot be discriminated perfectly. That 
means that if we are given \ipi}-, we will have some proba- 
bility pi to distinguish it successfully and, correspond- 
ingly, some failure probability qi = I — pi to obtain 
an inconclusive answer. Denote by Ji the Hilbert space 
spanned by the initial states \ifot), • • • 1 1VVi)}- Since 



there is a chance to get an inconclusive answer, the num- 
ber of outcomes of this process is larger than the dimen- 
sion of Tt, hence this process is a "generalized measure- 
ment" which can be represented by a set of operators 
which form a resolution of the identity , 



(2.1) 



where Ai is the operator that corresponds to the out- 
come \ipi), and Aj is the operator that corresponds to 
the inconclusive outcome. In more detail, if p is the den- 
sity matrix of our given state, then the probability of 
obtaining the fcth outcome, where k can be 1, ... n or 7, 
is pk — Tr{pA\Ak) and if the outcome is k, then the 
resulting density matrix is AkpA\/pk- The requirement 
that the discrimination be error free implies that 



{tp^A^Ak^i) = pidik. 



(2.2) 



From this, by an application of the Schwarz inequality, 
it follows that 



{ip k \Ai^i\tpj) ^Pi^jiSjk, 



(2.3) 



If we denote by rji the a priori probability that the 
system was prepared in the state \ipi), the average prob- 
abilities of success and of failure to distinguish the states 
\ipi) are, respectively, 



p = ^2 viPi 

i 

Q = X] ^ 



(2.4) 



Our objective is to find the set of {pi} that maximizes 
the probability of success, P, and the set of operators Ak 
that realize the corresponding generalized measurement. 
Define the failure state, \<fii) as 



|&) = A z |Vi>. 



(2.5) 



This is the state of the system if the input state was \tpi) 
and the outcome was inconclusive. Chefles |jj showed 
that the states are linearly dependent when P is a 

maximum. The interpretation of this result is the follow- 
ing. Because only linearly independent states can be dis- 
criminated without error, the operator corresponding to 
the inconclusive outcome maps the set of linearly depen- 
dent states {IV'i)} onto a linearly-dependent set, which 
then cannot be unambiguously discriminated by any fur- 
ther process. As we shall see, however, this does not 
mean that some information cannot be extracted from 
an inconclusive result. 

Now consider the inner product (cf>k\4>j), an d de fin e the 



matrix C by Cy = {4>k\4>j). Using equations (2.1), (2.5) 
and ( 2^2 ) , we find 



(2.6) 
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The matrix C is positive-semidefinite. This can be seen 
by noting that for any n-dimensional vector, whose com- 
ponents we shall denote by hi, where i=l,...n, 

n n 

J2 ^A = ll£WII 2 >o. (2.7) 

When the pi are equal to their optimal values, i. c. the 
values which maximize P, the linear dependence of the 
\<f>i) implies that 



det(C) = 0, 



(2.8) 



so that C has at least one zero eigenvalue when P is a 
maximum. 

In conclusion, the optimum probabilities pi can be 
found by maximizing P subject to the following con- 
straints: 

i) det(C) = 0, 

ii) C is non-negative, or, equivalently, all of the prin- 
cipal minors of C are non-negative. 

When we consider the case of just two non-orthogonal 
states, the above result immediately gives the following 
relationship between any two failure probabilities: 



qi-q 2 = | (^1^2) | 5 



(2.9) 



This is the same result that was obtained in our previ- 
ous paper |19|. In particular, when the two states have 
equal a priori probabilities, r/i = 772 = 5, we found the 
maximum probability of success to be, 



p=i-\(ihm\- 



(2.10) 



This is the well known Ivanovic-Dieks-Peres (IDP) limit 

§-!■ 



III. REALIZATION OF GENERALIZED 
MEASUREMENT 

Once wc know the set of optimum discrimination prob- 
abilities {pi}, we would like to find a realizable experi- 
mental procedure to achieve it. We shall do this first 
abstractly, and then show how it can be realized by lin- 
ear optical elements. Let us first summarize the proce- 
dure, and subsequently fill in the details. We begin with 
a total Hilbert space fC, which is the direct sum of two 
subspaces, tC — Ti® A. The space 7i is an n-dimensional 
space that contains the vectors and A is the space 
that will contain the failure vectors \4>i)- We shall denote 
the dimension of A by m. The input state of the system 
is one of the vectors \tpi), which is now a vector in the 
subspace H. of the total space IC. A unitary transforma- 
tion, U, which acts in the entire space JC is now applied 
to the input vector, resulting in the state \tpi)out- A 



measurement is performed on the part of 1^ )out in A. 
and, if the proper result is obtained, the vector } out 
is projected onto the vector \e^), which lies in the sub- 
space TL. The probability of this occuring is pi. The 



vectors {| 



,H\ 



1, . . .«} are orthonormal and can be 



distinguished perfectly. The effect of the unitary trans- 
formation on an extended space and the measurement 
is to map a set of non-orthogonal vectors onto a set of 
orthogonal ones. 

We now need to specify U and the measurement, and 
let us discuss the latter first. The measurement has two 
outcomes, one of them corresponding to the operator, 
Pu, which projects onto the subspace 7i, and the other 
to the operator P4 = I—Ph, which projects onto the sub- 
space A. The first outcome corresponds to the successful 
transformation of \ipf')out into |e^), and its probability 
of occurence is pi. This implies that 



(3.1) 



where |</>^) is a failure state, and we have added a su- 
perscript A to denote the fact that it is in the subspace 
A. The other outcome corresponds to obtaining an in- 
conclusive answer and transforms \^) ut into 

Eq. ( |3.l[ ) and the fact that U is unitary implies that 



) = (W\W)-Pj6 jk , 



(3.2) 



which is just Eq. (2.6). If (and only if) the matrix C is 
non-negative, we can always find vectors \4»f) that satisfy 
this equation. This follows from the fact that a non- 
negative matrix can be written as the product of a matrix 
and its adjoint, in particular, we can express C as 



C = A^A, 



(3.3) 



for some matrix A. If we define \cj>j) ~ A\j), where 
\j) is the vector whose jth component is one and all 
of whose other components are zero, then we have that 
Cjk = {4>j\4>k) ■ Once we have found these vectors and 
specified the vectors |e^), then the operator U can be 



found by means of Eq. (3.1). These conditions may not 
completely determine U\ if they do not, then there is 
freedom in choosing it. This will be the case if the di- 
mension of A is greater than one. U maps vectors in A 
to vectors in the subspace, S, of JC that consists of the 
vectors that are orthogonal to all of the vectors lip^out- 
The dimension of S is m. T he f reedom in choosing U 
comes from the fact that Eq. (3.1) does not specify how 
A is mapped into iS. If both are one-dimensional, then 
the mapping is determined (up to an overall phase), but 
if their dimension is greater than two it is not. 

Once the measurement and the operator U have been 
specified, our realization of the generalized measurement 
is completely determined. The next task is to find a 
physical system with which to implement it. 



3 



IV. OPTICAL REALIZATION OF NON-UNITARY 
TRANSFORMATION 



N 



(4.4) 



We now want to propose an experimental procedure to 
achieve our non-unitary transformation by using optical 
devices. We shall show how this can be accomplished 
by using a single-photon representation of the states \tpi) 
and an optical multiport together with photodetectors 
at the output ports to carry out the desired non-unitary 
transformation. 

Our Hilbert space will consist of a single photon, which 
is divided among n + m modes. The modes themselves 
could be distinguished by having different wave vectors 
or they might be modes of different optical fibers. A 
basis for this space consists of the single photon states 
{<4|0)|j = l,...n + to}, where |0) is the vacuum state 



and al is the creation operator for the jth mode. The 
states {aj|0)|j = 1, . . . n} form a basis for the space H, 

and the states {a] |0) \j = n + 1, . . . n + m} form a basis for 
the space A. The initial states \ipi) can be represented 
as single photon states in Ti., which can be written as 



a]\0), 



(4-1) 



where we have chosen the states |e^) to be \e¥) = a] \0 n ) ■ 
An optical 27V-port is a lossless linear device with N 
input ports and N output ports. Its action on the input 
states can be described by a unitary operator, U 2 n, and 
physically it consists of an arrangement of beam split- 
ters, phase shifters, and mirrors. Choosing N = n + to, 
we send the single photon state into the first n input 
ports, which correspond to Ti, and the vacuum into the 
remaining m input ports, which correspond to A. Pho- 
todetectors are placed at the last m output ports (the 
ones corresponding to A) , and if there is no photon de- 
tected, the desired non-unitary transformation will have 
been carried out. In particular, with } out — U 2 N\ipi)) 
where ) out is given by Eq. (3.1), the action of the 



measurement, if successful, is to project the output state 
onto and the probability to achieve this is pi. 

If we denote the annihilation operators correspond- 
ing to the input modes of the 2./V-port by Oji n , j — 
1,2, ... ,N, then the output operators are given by 

N 

a jout = U 2J ^aj in U2N = *^^Mjk a kin, (4-2) 
fe=l 

where Mjh are the elements of an N x N unitary matrix 
M. In the Schrodinger picture, the in and out states are 
related by 

\il> K )out = U 2N \i> K ) in . (4.3) 
In general, for an in state that contains a single photon 



where Ylj=i \ c j\ 2 = 1> the out state is given by 

H K )out = U 2N \^' C )in 
N 

3=1 

N 
3,k=l 



(4.5) 



Note that we have made use of the fact that the vacuum 
is invariant under the transformation, L^at. This implies 
that the matrix elements Mu is the same as the matrix 
element of U 2 n between the single-particle states \i) — 
a| in |0) and |Z) = aj in |0). Choosing Cj — 5ji in the above 
equation and then taking the inner product of the result 
with \i), we find that 



(i\U 2N \l) = M a . 



(4.6) 



The desired matrix M can be found from Eq. (3.1), and 
our next task is to decompose it in such a way that it 
corresponds to a linear optical network. 

This problem has been solved by M. Reck et al. [ p(i[ , 
and we shall summarize their method. They gave an 
algorithmic procedure to factorize any N x N unitary 
matrix into a product of two-dimensional t/(2) transfor- 
mations, and it is this procedure that we shall adopt here 
to construct our 2 Alport, which is characterized by the 
matrix M of equation (4.2). 

It is well known that a lossless beam splitter and a 
phase shifter with appropriate parameters can implement 
any U{2) transformation; a beam splitter with a phase 
shifter at one output port transforms the input operators 
into output operators as 



ai 
a 2 



e^sinw e 1 ^ cosw 



a 2 



(4.7) 



where a±, a 2 are the annihilation operators of modes 1 
and 2 respectively. In their paper, Reck, et al. consid- 
ered the use of a Mach-Zchnder interferometer to simu- 
late the effect of a beam splitter that does not split the 
incoming beam equally, in which case ui describes the 
reflectivity and transmittance of the effective beam split- 
ter with >/R = sinw, = cosw, and cj) describes the 
effect of the phase shifter. If the matrix describes an ac- 
tual beam splitter, then \/R = cosw, and VT = SVD.LJ. 
Any N x N unitary matrix U (N) can be reduced to an 
(JV - 1) x (N - 1) unitary matrix, U(N - 1), by multi- 
plying from the right by a succession of two-dimensional 
unitary matrices 



U(N) ■ R(l) 



U(N-l) 



(4.8) 
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Here R(l) — T\ t 2 ■ ?\,3 • • ■ Ti,Ar, and T p , q is denned as 
an TV-dimensional identity matrix with elements I pp , I pq , 
I qp , I qq replaced by the corresponding elements of a U(2) 
matrix. It performs a unitary transformation on a two- 
dimensional subspace of the full N dimensional space, 
and can be implemented by attaching a beam splitter 
and a phase shifter to ports p and q. 

We can repeat the above transformation, decreasing 
the dimension of the remaining unitary matrix by one at 
each step. Applying this procedure to the matrix M of 
equation ( |t.2| )) we have that 

M ■ R(l) ■ R(2) ■ ■ ■ R(n + to) 



M 1 



( 







e 1 ' 




\ 



^ 



(4.9) 



Denoting by D{a\, ct2 ■ ■ ■ a n ) the diagonal matrix 



D = 



( e- lai 

e -ia 2 





-4as n _j- OT 



(4.10) 



we have 



M ■ R{1) ■ R{2) ■ ■ ■ R(n + to - 1) • D = 1, (4.11) 



i.e. 



M = D^ 1 ■ R(n + m - 1)~ 



R(1Y 



(4.12) 



Since the product of matrices is equival ent to setting 
up experimental devices in sequence, Eq. ( |4.12D implies 
that to get M, the actual experimental setup is made of 
a series of U(2) blocks to achieve i?(n + m) _1 • • • i?(l) _1 , 
and n + m appropriate phase shifters attached to the 
output ports to produce D . Figure 1 gives a picture 
of the practical implementation of M. 

It is possible to save some steps by modifying this pro- 
cedure. As was mentioned earlier, the matrix M is not al- 
ways completely determined. In particular, there is free- 
dom in choosing the matrix elements Mjk for k > n. Let 
us now see what happens if we apply the procedure of 
Reck, et al. to the transpose of M, M T , instead of M 
itself. It is now the matrix elements (M T )jk, for j > n 
that are not completely determined, and we shall leave 
them that way for now. The matrixes making up i?(l) 
are chosen to make all of the elements, except the first, 
of the the first row of M T R(1) zero. In finding each of 
the matrixes Ti q , we only need to use the matrix ele- 
ments that are completely determined (this is not true if 
we start with M instead of M T ). Now if the first row 
of M T R(l) is zero except for the first element, then by 
unitarity, the first column is also zero, except for its first 
element. Continuing in this way we have that 



R(l) • R(2) ■ ■ ■ R(n) 



t 










\ 







e M2 
















\ 








M m 


) 



(4.13) 



where M m is an m x m unitary matrix that contains the 
information about the matrix elements in M that are not 
completely specified. At this point, we can choose M m 
to be any unitary matrix, and the simplest choice is the 
m x m identity matrix, I m . Defining 



D' = 



V 










\ 

Im J 



(4.14) 



we have that 

M = [R(l)R(2)---R(n)D'Y 



(4.15) 



V. APPLICATION TO THREE STATES AND 
EXAMPLES 

In this section we first apply the above considerations 
to the problem of realizing optimal discrimination among 
three non-orthogonal but linearly-independent quantum 
states, in general. Then we illustrate the method on spe- 
cific examples. For simplicity, we assume that the a priori 
probabilities are all equal, 771 — 772 — 773 = 1/3. 

From Eq. (2.4), the probability of failure is 



Q 



1 3 



(5.1) 



The requirement of the linear dependence of the \4>i) vec- 
tors (i — 1,2,3) leads to the constraint given by Eq. (fEjj) ). 
For the case of three vectors it can be written as 

A = det(C) 

= <7i<72<73 - <7i|0 2 3| 2 - 92|Oi 3 | 2 - q 3 |Oi 2 | 2 

+O12O23O13 + o* 12 o; 3 o 13 

= 0, (5.2) 

where O i3 = {ipi\ipj). 

Employing the Lagrange multiplier method, we wish 
to minimize the quantity 



(5.3) 



which immediately leads to the conditions 



5 



oqi 3 
dQ' _ 1 
#92 3 

go; _ i 

dq 3 3 



AA 13 = 0, 
AA 12 = 0, 



(5.4) 



where A is a Lagrange multiplier, and Aj.2, A13, A23 are 
subdeterminants of C, A i2 = 91(72 — |Oi2 1 2 , etc. Equation 
(O) implies that 



A12 



n3 



1 

3A' 



(5.5) 



This means that all three subdeterminants are equal. Let 
S = — -j denote this common value and recall that all 
subdeterminants of C must be non-negative, so that 5 > 
0. 



From Eq. (5.5) we can solve for the q^s, yielding 



9i 



92 = 



93 



(\Ol2 


2 + <5)(|0 13 | 2 


+ 5) 


(\0 23 \ 2 + 5) 


(|0 12 


2 + <5)(|0 23 | 2 


+ 6) 


(|0 13 | 2 + <5) 


(|Ol3 


2 + <5)(|o 23 | 2 


+ S) 


(|Oi 2 | 2 + <5) 



(5.6) 



Finally, we can substitute Eq. (^J) into Eq. ( |5.2j ) to solve 
for 5 and then use the above equations to find the corre- 
sponding qi values. When we solve for 5, there are often 
a number of different solutions. However, we need only 
consider solutions that are greater than or equal to zero, 
and which give values of qi that are between and 1. If 
there are several solutions that satisfy these conditions, 
we must determine which one gives the actual minimum. 
If there are none, then we must examine the boundary 
of the allowed region to find the minimum. The point 
(91, 92, 93) lies inside or on the surface of a unit cube one 
whose vertices lie on the points (j, k, I), where j, k, I = 
or 1. If the Lagrange multiplier approach does not yield a 
valid solution the minimum of Q subject to the constraint 
A = must lie on the surface of the cube. 

Note that if the overlaps are real and positive, a sit- 
uation we shall con sider shortly, then 6 = is always a 
solution of Eq. ( |5.2|) . In this case, if all the corresponding 
qi for S = are between and 1, then this set of {qi} 
is a possible solution to our problem, i.e. a minimum of 
Q that satisfies A = 0. If it is, in fact the solution, we 
see that A12 = A13 = A23 = 0, which implies that each 
possible pair of the states |</>i), i — 1,2,3 is linearly de- 
pendent, so that all three states <f>i are in a line, i.e. the 
dimensionality of the auxiliary Hilbert space A is one. 
If the solution to the problem is one for which 8 > 0, 
no pair of failure states is linearly dependent. However, 
the three failure states together are linearly dependent, 



so that in this case the dimensionality of the auxiliary 
Hilbert space A is two. 

Next, we shall consider specific examples involving 
three non-orthogonal but linearly independent state vec- 
tors, to illustrate the general considerations of the previ- 
ous sections. In particular we want to determine explic- 
itly the parameters and dimensionality for the special 
multiports that optimally discriminate among the three 
quantum states. For simplicity, we shall assume that the 
a priori probabilities are equal in all of our examples. 

Our first case is a simple one; the overlaps of the three 
states will be assumed to be real and equal 



(5.7) 



where < s < 1. The constraint of equation ( pTg| ) is, in 
this case, 



9i9293 



s 2 H 



2s 6 = 0, 



(5.8) 



application of the the Lagrangian multiplier method im- 
plies that qi = q 2 = 93, and that 



o 2 
3s qt 



2s 6 = 0. 



(5.9) 



This equation has two solutions, 9; = s, — 2s, of which 
only qi — s is valid. This solution is a minimum and it 
implies that the optimal value of the total failure proba- 
bility is Q = s. 

Our next step is to find the failure vectors. For any 
3x3 positive matrix, L, we find that we can express its 



matrix elements as Ly = 

\<h)-- 

102) = 



103 



if 




(5.10) 



where Ai 2 = L11L22 ~ |ii 2 | 2 and A = detL. Applying 
this to the matrix C, with 9; = s, i = 1, 2, 3, we find that 
the three failure vectors are identical, they all have mag- 
nitude i/s and point in the same direction. Therefore, 
our failure space, A, is one dimensional, the full Hilbert 
space K, = Ti. © A is four dimensional, and we will need 
an eight port to accomplish our unitary transformation. 

In order to find the necessary unitary transformation, 
we must first specify our input states. Let us choose our 
three states to be (in the full space, JC = H (B A) 
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Vl+2s 






^3 
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G 



IV£}in 
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V6 
V3 
V2 





(5.11) 



/ 



where ipi are represented by single photon states. One 
can verify that (V'xl^) = (V^IV^s) = (ifalfi) = s. The 
output states can be found from Eq. ( |3.1[ ), and are ex- 
plicitly given by 



\^)out 



\^f)out 



VT—s 



V~s 





2 lout 




(5.12) 



The unitary transformation, U, maps the input states 
onto the output states, i.e. \ipf) ut = U\^pf)im for 
i = 1,2,3. In addition, it must map the vector that 
is orthogonal to the three input vectors onto the vector 
that is orthogonal to the three output vectors, 



\/27TT 



= u 



(5.13) 



The action of U on these four vectors completely deter- 
mines it, and we find that it is given by the matrix M(4), 
which is 



where we shall assume, for simplicity, that s% and S2 are 
real and between and 1. For a fixed value of si there is a 
restriction on how large S2 can be. The largest the angle 
between ip2 and ip3 can be is twice the angle between ip\ 
and ip2 (this maximum is achieved when the vectors are 
coplanar). This implies that S2 > 2sf — 1. Application 
of the Lagrange multiplier method to the minimization 
of Q' gives us q2 = (73 and 



Qi = 



?! 



<12 



(5.17) 



Substituting these results into the constraint equation 
and defining y — 52/^2 and (3 — s\/ s 2 , we have 



y 



(2 + f3 2 )y 2 + 2/3 2 y + l-(3 2 = 0. 



(5.18) 



The roots of this equation are y = 1, — 1 ±/3, and two of 
them 1 and (3—1 yield valid solutions, the latter if (3 > 1. 
Substitution of these results into Q shows that if (3 < 2, 
then the solution y — 1 gives the minimum and if [3 > 2, 
then y = (3— 1 gives the minimum. Summarizing, we find 
that if (3 < 2, the minimum value of Q is [(sf/s2)+2sa]/3 
and (solution 1) 



<7i = — 

S2 

q-2 = q3 = S2 , 



(5.19) 



and if (3 > 2, then the minimum value of Q is 2(2si — S2) /3 
and (solution 2) 



qi = 2s 1 

92 = <73 = si - s 2 . 



(5.20) 



M(4) = 



V 
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1 
1 
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3(2s+l) 



l-s 
3(2s+l) 

3s 



2s+l 
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1 





2s+l 



2s+l 



(5.14) 



Using the method described in Sec. IV, M(4) can be 
factorized as 



M (4) = Ti, 2 • Ti, 3 • T 2 , 3 • T 2 , 4 , 



(5.15) 



where the parameters that determine the matrixes T pq 
are given in table 1 (this example is referred to as case 1). 
Note that because these matrixes are real, the complex 
conjugate, which appears in Eq. (4.15) is unnecessary. 

Now let us consider a more general case than the one 
we have been studying so far. We shall assume that two 
of the overlaps are the same and the third is different, in 
particular that 



(V'iI^) 
(^2^3) 



(^3) 

S2 



Si 



(5.16) 



Clearly, for these solutions to be valid, all of the proba- 
bilities have to be between and 1. 

The next step is to find the failur e vec tors. If solution 
1 is the valid one, we find from Eq. (5.10) that the failure 



space is one dimensional, and if 
basis vector for this space, then 



Si 



S2 

\44) 



S2\U?) 



is the normalized 



(5.21) 



If solutions 2 is the valid one, then the failure space is 
two-dimensional. If \v£) where j = 1,2 is an orthonor- 
mal basis for this space we find that 



\4>i 



8 2 \v£) 



a 2 \v£) 



(5.22) 



Let us look at an example of each solution. If we choose 
our three states to be ipi — (1, 0, 0), ^2 = ^ (L 1, 1) and 
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-03 = ^§(1) 1' — 1)j we nn d that si = 1/V3 and s 2 = 1/3, 
so that solution 1 is valid. The complete treatment of this 
case (case 2) is given in Table 2. We see that we need 
an eight-port which can be built up by two U(2) blocks. 
Note that in order to achieve minimum failure probability 
Q, we need to choose q\ to be 1, which means that we 
sacrifice the possibility of distinguishing state 

If we choose our states to be ipi = (1,0,0), ip2 = 
|(1,2,2) and tp 3 = §(1,2,-2), then we find that solu- 
tion 2 is valid with si — 1/3 and s 2 = 1/9- In this case 
(case 3) we need a ten-port, and the complete results are 
given in Table 3. Note that if the procedure fails, it is still 
possible to gain some information about the input state, 
because the failure space is two-dimensional ||. This is 
not possible if the failure space has only one dimension. 

One possibility is to attach to the failure-space outputs 
(outputs 4 and 5) a network that transforms states l^rf) 
and \4>^} into orthogonal states, which it will do only with 
a certain probability |l9f| . In particular, we can construct 
a network that implements the transformation 



M(3) = 




(5.23) 



where the inputs to the first two ports of this network (wc 
shall call these ports A and B) are the outputs of ports 4 
and 5 of the original network, and the input to the third 
port (port C) is the vacuum. This network has been de- 
signed so that if no photon is detected emerging from 
output C, then the input state will be transformed 
into a photon emerging from port A, and the input state 
jc^g 1 ) will be transformed into a photon emerging from 
port B. If the input state is \4>i~) and no photon is de- 
tected at output C, the probabilities of a photon emerging 
from either port A or port B are the same. Therefore, 
if the photon emerges from port A, we can conclude the 
input to the entire network was either -01 or ip2 , and if it 
emerges from port B, then the input was either tpi or tp 3 . 
Summarizing, if one of the detectors in ports 1 through 3 
clicks, we know what the input state was. If the detector 
in either ports A or B clicks, then we gain partial infor- 
mation about the input state; the number of possibilitcs 
has been reduced from three to two. If the detector in 
port C clicks, then we have gained no information about 
the input state, and this happens with a probability of 
1/9 if the inputs were ip2 or ^3 and 4/9 if the input was 
■01. The addition of the second network to the failure 
outputs of the first significantly improves the chances of 
gaining some information about the input state. 



VI. CONCLUSIONS 

We have shown that nonorthogonal quantum states, 
each realized as a photon split among several modes, can 



be conditionally distinguished by means of a linear opti- 
cal network. For three states we have given explicit net- 
works, which give the maximum success probabilities for 
several sets of states. In addition, it was shown that the 
addition of a second network to the outputs correspond- 
ing to a failure of the initial network to distinguish the 
states, can sometimes provide partial information about 
the input state. We believe it should be possible to con- 
struct these networks in the laboratory. 
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FIG. 1. The implementation ol an optical multiport that 
perfo rms the unitary transformation M(N) described in Eq. 
(4.12). The beams are straight lines, a suitable beam splitter 



is at each crossing point of the first n diagonal lines, phase 
shifters are at one input of each beam splitter and at the first 
n outputs. Each diagonal line of the multiport reduces the 
dimension of M(N) by one. 



TABLE I. Arguments of T PtQ for case 1. The range of arc- 



cos is the interval [0, ■k 






UJ 


4> 


Tl,2 


arccos [__i_] 





Tl,3 


arccos [__i_] 





^2,3 


- arccos[-y /r |] 





2^2,4 


arccos[^_] 










TABLE II. Summary of parameters and arguments of T p>q 
for case 2. 





case 2 


Input states 




01 

!!) 




V'2 
(?) 

V / 




V'3 

Vs 

73 
V J 




Optimal failure 
probability 


91 = l 

92 = k 

93 = a 


Output states 




01 

(!! 


1 


V'2 

/ ^ 

pi 

V 3 



V 7S / 




V'3 

(I) 

V 73 ) 




M = 




/ 1 \ 

75 75 
7f -75 

V i o oo/ 




Factorization 
of M 


M = T M • T 2 , 3 


Arguments 

Of T p . q 


Ti 4 : w = 0, = 
r 2 ,3 :w = f,0 = O 



TABLE III. Summary of parameters and arguments of T p>q 
for case 3. 
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Input states 


tpi 
fl\ 
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2 
3 
2 
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/ 
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V'3 

' * \ 

2 

3 
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3 



. o y 


Optimal failure 
probability 


91 = | 

92 = | 

93 = | 


Output states 


01 




pi 

V 3 
V o / 


V'2 

( 

77 
3 



1 
76 

V 3 V2 , 


\ 
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V'3 

( ^ 



77 
3 
1 

V 3 72 / 


M 




/ J_ i o -<f^ \ 

n V7 7f 11 
u 4 4 4 4 
n 77 77 1 1 
u 4 4 4 4 

\fl ris \fh 

l 2 ^ >/iJ 




Factorization 
of M 


M — Ti.4 ■ T 2 .3 ■ T2,4 • ^3,5 


Arguments 
of T Piq 


Ti ; 4 : u) = arccos [1 

T 2 . 3 : u= f,0 = C 
7*2,4 ■ u) = arccos [ 

T 3>5 : arccos [ 2 ^] 


Ji 

,<f> 


],0 = O 

= 
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